Reinforcement Learning-based Product Delivery Frequency Control

نویسندگان

چکیده

Frequency control is an important problem in modern recommender systems. It dictates the delivery frequency of recommendations to maintain product quality and efficiency. For example, delivering promotional notifications impacts daily metrics as well infrastructure resource consumption (e.g. CPU memory usage). There remain open questions on what objective we should optimize represent business values long term best, how balance between a dynamically fluctuating environment. We propose personalized methodology for problem, which combines long-term value optimization using reinforcement learning (RL) with robust volume technique termed "Effective Factor". demonstrate statistically significant improvement efficiency by our method several notification applications at scale billions users. To best knowledge, study represents first deep RL application such industrial scale.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that mitigate each of these curses. To handle the state-space explosion, we introduce “tabular linear functions” that generalize tile-coding and linear value functions. Action space complexity is reduced by replacing compl...

متن کامل

Reinforcement Learning Based PID Control of Wind Energy Conversion Systems

In this paper an adaptive PID controller for Wind Energy Conversion Systems (WECS) has been developed. Theadaptation technique applied to this controller is based on Reinforcement Learning (RL) theory. Nonlinearcharacteristics of wind variations as plant input, wind turbine structure and generator operational behaviordemand for high quality adaptive controller to ensure both robust stability an...

متن کامل

A Reinforcement Learning Approach for Product Delivery by Multiple Vehicles

Real-time delivery of products in the context of stochastic demands and multiple vehicles is a difficult problem, as it requires the joint investigation of the problems in inventory control and vehicle routing. We model this problem in the framework of Average-reward Reinforcement Learning (ARL) and present experimental results on a modelbased ARL algorithm called H-Learning with piecewise line...

متن کامل

Scaling Average-reward Reinforcement Learning for Product Delivery

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state space and action space, and high stochasticity. We give partial solutions to each of these curses that provide order-of-magnitude speedups in execution time over standard approaches. We demonstrate our methods in the domain of product delivery. We present experimental results on refinem...

متن کامل

Reinforcement Learning-based Quadcopter Control

Analysis of quadcopter dynamics and control is conducted. A linearized quadcopter system is controlled using modern techniques. A MATLAB quadcopter control toolbox is presented for rapid visualization of system response. Waypoint-based trajectory control of a quadcopter is performed and appended to the MATLAB toolbox. Finally, an investigation of control using reinforcement learning is conducted.

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i17.17803